Visual Intelligence: Toward Machine Understanding of Video Content

نویسندگان

  • Michael C. Burl
  • Russell L. Knight
  • Anthony C. Barrett
چکیده

This paper describes progress toward developing visual intelligence algorithms (VI) that can produce humanlike text descriptions (captions) from video inputs. Video frames are assumed to be generated according to an underlying “script’ that specifies a camera model and the content and action in a scene. VI is formulated as the problem of recovering the script (or relevant portions of the script) given a sequence of video frames. Three types of scripts at different levels of abstraction are recovered: C-scripts contain object detections, poses, and descriptive information on a frame-by-frame basis; B-scripts assign persistent IDs to objects across frames and “smooth” frame-by-frame information; A-scripts provide a symbolic representation of video content using a sparse timeline in which Planning Executing Agent (PEA) graphical models (behavior snippets) are associated with agents in the scene. From the script representations, a compact text description (caption) of the action in the scene, as well as an envisionment (3D rendering) showing what the algorithm believes happened, can be generated. Scripts have been derived automatically and evaluated on a set of 240 publicly available video vignettes containing over 100,000 frames.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A Machine Learning Approach to No-Reference Objective Video Quality Assessment for High Definition Resources

The video quality assessment must be adapted to the human visual system, which is why researchers have performed subjective viewing experiments in order to obtain the conditions of encoding of video systems to provide the best quality to the user. The objective of this study is to assess the video quality using image features extraction without using reference video. RMSE values and processing ...

متن کامل

Learning Social Relations from Videos: Features, Models, and Analytics

Despite the progress made during the last decade in video understanding, extracting high-level semantics in the form of relations among the actors in a video is still an under-explored area. This chapter discusses a streamlined methodology to learn interactions between actors, construct social networks, identify communities, and find the leader of each community in a video sequence from a socio...

متن کامل

Learning from narrated instruction videos

Automatic assistants could guide a person or a robot in performing new tasks, such as changing a car tire or repotting a plant. Creating such assistants, however, is non-trivial and requires understanding of visual and verbal content of a video. Towards this goal, we here address the problem of automatically learning the main steps of a task from a set of narrated instruction videos. We develop...

متن کامل

A Novel Approach to Background Subtraction Using Visual Saliency Map

Generally human vision system searches for salient regions and movements in video scenes to lessen the search space and effort. Using visual saliency map for modelling gives important information for understanding in many applications. In this paper we present a simple method with low computation load using visual saliency map for background subtraction in video stream. The proposed technique i...

متن کامل

Recognition of Visual Events using Spatio-Temporal Information of the Video Signal

Recognition of visual events as a video analysis task has become popular in machine learning community. While the traditional approaches for detection of video events have been used for a long time, the recently evolved deep learning based methods have revolutionized this area. They have enabled event recognition systems to achieve detection rates which were not reachable by traditional approac...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2015